Learning Prior Feature and Attention Enhanced Image Inpainting
نویسندگان
چکیده
AbstractMany recent inpainting works have achieved impressive results by leveraging Deep Neural Networks (DNNs) to model various prior information for image restoration. Unfortunately, the performance of these methods is largely limited representation ability vanilla Convolutional (CNNs) backbones. On other hand, Vision Transformers (ViT) with self-supervised pre-training shown great potential many visual recognition and object detection tasks. A natural question whether task can be greatly benefited from ViT backbone? However, it nontrivial directly replace new backbones in networks, as an inverse problem fundamentally different To this end, paper incorporates based Masked AutoEncoder (MAE) into model, which enjoys richer informative priors enhance process. Moreover, we propose use attention MAE make learn more long-distance dependencies between masked unmasked regions. Sufficient ablations been discussed about models paper. Besides, experiments on both Places2 FFHQ demonstrate effectiveness our proposed model. Codes pre-trained are released https://github.com/ewrfcas/MAE-FAR.KeywordsImage inpaintingAttentionVision transformer
منابع مشابه
Image alignment via kernelized feature learning
Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...
متن کاملGenerative Image Inpainting with Contextual Attention
Recent deep learning based approaches have shown promising results for the challenging task of inpainting large missing regions in an image. These methods can generate visually plausible image structures and textures, but often create distorted structures or blurry textures inconsistent with surrounding areas. This is mainly due to ineffectiveness of convolutional neural networks in explicitly ...
متن کاملStudy Of Image Inpainting Based On Learning
In this paper, we construct a actual system of image inpainting based on the image inpainting system model[1] which was proposed before, in order to repair more types, more broken images in the different field and restore them more efficiently ,because there is no one “universal” or more general algorithm can repair all types of images. The proposed system integrates the commonly used several t...
متن کاملShift-Net: Image Inpainting via Deep Feature Rearrangement
Deep convolutional networks (CNNs) have exhibited their potential in image inpainting for producing plausible results. However, in most existing methods, e.g., context encoder, the missing parts are predicted by propagating the surrounding convolutional features through a fully connected layer, which intends to produce semantically plausible but blurry result. In this paper, we introduce a spec...
متن کاملEnhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention
263 Abstract—In the real world applications such as landmark search, copy protection, fake image detection, partial duplicate image retrieval is very important. In the internet era users regularly upload images which are partially duplicate images on the domains like facebook, instagram and whatsapp etc. The partial image is only part of whole image, and the various kind of transformation invol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19784-0_18